Finite mixtures of multivariate skew t-distributions: some recent and new results
نویسندگان
چکیده
Finite mixtures of multivariate skew t (MST) distributions have proven to be useful in modelling heterogeneous data with asymmetric and heavy tail behaviour. Recently, they have been exploited as an effective tool for modelling flow cytometric data. A number of algorithms for the computation of the maximum likelihood (ML) estimates for the model parameters of mixtures of MST distributions have been put forward in recent years. These implementations use various characterizations of the MST distribution, which are similar but not identical. While exact implementation of the expectation-maximization (EM) algorithm can be achieved for ‘restricted’ characterizations of the component skew t-distributions, Monte Carlo (MC) methods have been used to fit the ‘unrestricted’ models. In this paper, we review several recent fitting algorithms for finite mixtures of multivariate skew t-distributions, at the same time clarifying some of the connections between the various existing proposals. In particular, recent results have shown that the EM algorithm can be implemented exactly for faster computation of ML estimates for mixtures with unrestricted MST components. The gain in computational time is effected by noting that the semi-infinite integrals on the E-step of the EM algorithm can be put in the form of moments of the truncated multivariate non-central t-distribution, similar to the restricted case, which subsequently can be expressed in terms of the non-truncated form of the central t-distribution function for which fast algorithms are available. We present comparisons to illustrate the relative performance of the restricted and unrestricted models, and demonstrate the usefulness of the recently G. J. McLachlan Department of Mathematics, University of Queensland, St Lucia, 4072, Australia E-mail: [email protected] proposed methodology for the unrestricted MST mixture, by some applications to three real datasets.
منابع مشابه
Rejoinder to the discussion of "Model-based clustering and classification with non-normal mixture distributions"
Non-normal mixture distributions have received increasing attention in recent years. Finite mixtures of multivariate skew-symmetric distributions, in particular, the skew normal and skew t-mixture models, are emerging as promising extensions to the traditional normal and t-mixture models. Most of these parametric families of skew distributions are closely related, and can be classified into fou...
متن کاملA Family of Skew-Slash Distributions and Estimation of its Parameters via an EM Algorithm
Abstract. In this paper, a family of skew-slash distributions is defined and investigated. We define the new family by the scale mixture of a skew-elliptically distributed random variable with the power of a uniform random variable. This family of distributions contains slash-elliptical and skew-slash distributions. We obtain the moments and some distributional properties of the new family of d...
متن کاملConstruction of multivariate distributions: a review of some recent results
The construction of multivariate distributions is an active field of research in theoretical and applied statistics. In this paper some recent developments in this field are reviewed. Specifically, we study and review the following set of methods: (a) Construction of multivariate distributions based on order statistics, (b) Methods based on mixtures, (c) Conditionally specified distributions, (...
متن کاملSkew-slash distribution and its application in topics regression
In many issues of statistical modeling, the common assumption is that observations are normally distributed. In many real data applications, however, the true distribution is deviated from the normal. Thus, the main concern of most recent studies on analyzing data is to construct and the use of alternative distributions. In this regard, new classes of distributions such as slash and skew-sla...
متن کاملDetermination of the number of components in finite mixture distribution with Skew-t-Normal components
Abstract One of the main goal in the mixture distributions is to determine the number of components. There are different methods for determination the number of components, for example, Greedy-EM algorithm which is based on adding a new component to the model until satisfied the best number of components. The second method is based on maximum entropy and finally the third method is based on non...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Statistics and Computing
دوره 24 شماره
صفحات -
تاریخ انتشار 2014